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Abstract 


A  quasi-maximum  likelihood  estimator  of  the  break  date  is  analyzed.  Consistency  of 
the  estimator  is  demonstrated  under  very  general  conditions,  provided  that  the  data- 
generating  process  is  not  integrated.  However,  the  asymptotic  distribution  of  the  estimator 
is  quite  different  for  time  series  that  are  integrated  of  order  one.  In  that  case,  when  there 
is  no  break,  the  analyst  can  be  spuriously  led  to  the  estimation  of  a  break  near  the  middle 
of  the  time  series. 


1      Introduction 

The  importance  of  considering  structural  change  in  statistical  models  is  well  documented 
in  the  literature  (see  the  surveys  by  Zacks  (1983)  and  Krishnaiah  &  Miao  (1988)).  The 
problem  is  to  test  if  a  change  in  the  parameters  of  the  model  has  occured  and,  if  so, 
to  estimate  when  and  by  how  much.  The  problem  of  estimation  has  been  studied  for 
several  models  with  different  estimation  techniques  (see  Quandt  (1958),  Hinkley  (1970), 
Yao  (1987),  Miao  (1989),  Bai  (1991)  among  others).  Also  several  tests  have  been  proposed 
treating  the  break  date  as  unknown  (see  Quandt  (1960),  Brown,  Durbin  &  Evans  (1975), 
Hawkins  (1987)  among  others  for  iid  observations  or  simple  regression  models,  and  An- 
drews (1990),  Hansen  (1992),  Chu  &  White  (1992)  among  others  for  more  general  models). 
However,  Chu  &  White  (1992)  show  that  when  the  series  is  generated  by  an  integrated 
process  of  order  one  (1(1)),  i.e.  a  process  with  one  unit  autoregressive  root,  without  struc- 
tural change,  the  test  procedures  will  lead  to  the  rejection  of  the  null  hypothesis  of  no 
structural  change  far  too  often. 

A  similar  problem  occurs  when  testing  for  unit  roots  in  economic  time-series  where  it 
is  necessary  to  know  in  advance  if  there  has  been  a  structural  change  or  not.  Perron  (1989) 
shows  that  standard  tests  of  the  unit  root  hypothesis  against  trend-stationary  alternatives 
reject  the  unit  root  hypothesis  too  infrequently  if  the  true  data  generating  process  (DGP) 
is  that  of  stationary  fluctuations  around  a  trend  function  which  contains  a  one-time  break. 

These  dual  problems  occur  because  a  time-series  generated  by  an  1(1)  process"  is  very 
difficult  to  distinguish  from  one  generated  by  a  stationary  process  (1(0))  with  structural 
change.  This  is  well  illustrated  in  Hendry  &  Neale  (1991). 

In  this  paper  we  concentrate  on  the  estimation  of  the  break  date.  We  show  that 
the  quasi- maximum-likelihood  estimator  for  the  break  date  is  consistent  under  very  weak 
assumptions  on  the  error  process  provided  it  is  not  integrated  of  positive  order.  This 
extends  previous  theorems  which  require  independence  and  normality  of  the  errors,  e.g., 
Krishnaiah  &  Miao  (1988).  We  also  allow  for  trending  regressors. 

There  is  no  study  of  the  effect  of  integrated  processes  on  the  distribution  of  the  estima- 
tor of  the  break  date.  In  this  paper  we  also  derive  the  asymptotic  distribution  of  the  break 
date  quasi-maximum-likelihood  estimator  in  this  case.   We  show  that  when  a  variable  is 


1 


generated  by  an  1(1)  process  without  any  structural  change,  the  estimation  of  a  model  with 
structural  change  will  suggest  a  spurious  break.  This  result  is  analogous  to  the  spurious 
regression  and  spurious  trend  results  of  Granger  &  Newbold  (1974),  Phillips  (1986)  and 
Durlauf  k  Phillips  (1988)  and  complements  the  findings  of  Chu  &  White  (1992). 

In  section  2  we  introduce  a  model  of  structural  change,  and  the  quasi-maximum- 
likelihood  estimator  of  the  break  date.  Asymptotic  distributional  results  for  the  case 
where  the  error  terms  are  not  integrated  are  given  in  section  3.  Section  4  derives  results 
for  the  case  where  the  error  terms  are  integrated  of  order  one.  In  section  5  we  report 
simulation  evidence  confirming  that  our  theoretical  results  provide  useful  insights  into 
what  would  be  found  in  the  analysis  of  samples  of  moderate  size.  Proofs  of  the  main 
theorems  are  given  in  the  appendix. 

2      Structural  Change  Model 

Given  observations  {yt,x't},  where  xt  is  p  x  1,  suppose  we  model  the  data  generating 
process  (DGP)  of  yt  as 

yt  =  x,t0t  +  eu  (1) 

where  {et}  is  an  1(0)  stochastic  process.  Consider  a  general  change  function  for  0t  given 
by 

Pt  =  P  +  9(T)(t/T),      <  =  l,2,-.-,T,  (2) 

where  g^T\X)  is  a  function  of  A  €  [0, 1]  and  may  depend  on  the  sample  size  T.  With  the 
prior  belief  that  there  is  a  single  structural  change  occuring  at  some  unknown  point  ko 
the  model  to  be  estimated  can  be  written  as: 
I    x't0i  +€t,    t=l,---,k0, 

This  corresponds  to  (3  =  f}\  and  g^T\\)  =  {fa  -  /3i)1{a>a0}  in  (2)'  wnere  1  denotes  the 
indicator  function.  The  quasi-log- likelihood  function  is  given  by 

-  \  In  2*  -   |  In  a\  -   ±  f|>  -  x^)2  +    |>t-a^2)2J  . 


Conditional  on  k,  the  quasi-maximum-likelihood  estimators  (QMLE)  of  fi\,  02,  and  of  are 


and  d2t{k)  =  RSST(k)/T,  where 

RSST(*)  =    [  J>,  ~  *5&(*))a  +     £  (»«  "  *'A(*))2  ]  •  (4) 

\t=i  «=Jfc+i  / 

The  concentrated  quasi-log-likelihood  function  is  thus 

T  T        T 

-  —  In  2tt  -   —   -   -ln<rt2(ib). 
2  2         2        tV   ; 

The  QMLE  of  A;  is  the  integer  k-p  which  solves 

min     tff(fc)         or  min     RSSr(k)-  (5) 

p<k<T—p  p<k<T—p 

Also  define 

At  =  min{A  :  A  =  argmintt€r»  ji  RSStQTu])}  (6) 

where  A  and  A  are  some  pre-specified  constants  between  0  and  1,  and  [Tu]  denotes  the 
integer  part  of  Tit.  A  common  choice  for  [A,  A]  is  [.15,  .85].  The  reason  for  this  trimming 
is  explained  for  example  in  Andrews  (1990)  and  Perron  &  Vogelsang  (1992). 

The  following  condition  will  be  assumed  to  hold  in  the  rest  of  the  paper.  Let  Dt  be 
a  p  x  p  diagonal  matrix  with  diagonal  elements  rf,r,  where  d<T  are  (possibly  different) 
powers  of  T  and  d~j  — »  0.  This  matrix  is  needed  to  normalize  xt  properly  because  the 
components  of  xt  may  be  of  different  orders  of  magnitude  in  t.  The  condition  is  given  by: 

[Al]  £>71/2  Y^=\  xtx'tD^1/2  -£+  Q(A)  uniformly  in  A  €  (0, 1],  for  some  £>T,  where  Q(X)  is 
positive-definite,  symmetric,  and  an  absolutely  continuous,  monotonically  increasing 
function  of  A;  that  is,  Q(X2)  -  Q(^i)  is  nonnegative-definite  for  all  A2  >  A^ 

Condition  [Al]  is  a  law  of  large  numbers  type  of  condition.  It  can  be  seen  that  [Al]  is 
satisfied  in  the  following  leading  examples. 

Examples: 


1.  Change  in  mean:  xt  is  the  constant  one. 

Clearly,  [Al]  is  satisfied  with  DT  =  T  and  Q(X)  =  X. 

2.  Change  in  trend:  xt  —  (1,  t)' . 
[Al]  is  satisfied  with 


DT   = 


T     0 
0     T3 


QW  = 


X       A2/2 
X2/2    A3/3 


3.  Change  in  regression  coefficients:  xt  is  a  p  x  1  random  vector. 
Under  standard  regularity  conditions, 


j      [TX] 

m^Xtx[ 


Q 


uniformly  in  A  6  (0, 1],  with  Q  a  positive-definite  matrix.    It  follows  that  [Al]  is 
satisfied  with  Dt  =  T Ip  and  Q(X)  =  XQ. 


3      Consistency 

Under  the  assumption  that  the  €t  are  independent  and  normally  distributed,  it  has  already 
been  shown  that  the  QMLE  X?  is  consistent  when  there  is  a  single  structural  change  at 
some  unknown  A  =  A0  €  (0,1),  e.g.,  Krishnaiah  &  Miao  (1988).  However,  this  result  is 
based  on  too  restrictive  assumptions.  Under  more  general  conditions,  we  show  that  the 
break  date  estimator  A^  converges  in  probability  to  the  true  break  date  A0  when  there 
is  a  single  structural  change  at  some  A  =  Ao  €  (0, 1),  or  to  the  set  {0, 1}  if  there  is  no 
structural  change. 

Consider  the  following  condition: 

[A2]  <7^(A)  is  a  function  of  bounded  variation  on  A  €  [0, 1],  which  may  depend  on  the 
sample  size  T,  such  that  for  Dt  in  [Al], 


T-b'*DT'2gVX\)->g*(X), 


uniformly  in  A  6  [0, 1]  for  some  function  g"  on  [0, 1]  and  some  constant  b  >  0. 


Condition  [A2]  characterizes  g*  as  the  limiting  behavior  of  g^;  in  particular,  if 
g(T>(\)  =  g(X)  then  <7*(A)  =  Fpg(X),  where  Fp  is  a  selection  matrix.  It  can  be  seen 
that  [A2]  is  satisfied  in  the  following  examples  where  a  single  change  has  occured  at  some 
date  ko  =  [TXo]  and  Ao  €  (0, 1)  is  a  constant. 

Examples: 

1.  Change  in  mean: 

Since  g^T\X)  =  ^1{a>a0}  =  #(^)>  [A2]  is  satisfied  with  6=1  and  g*(X)  =  <7(A). 

2.  Change  in  trend: 

(a)  Broken  Intercept:  yt  =  fa  +  <M{t>jfc0+i}  +  fat. 

Since  g^T\X)  =  (<$i,  0)'1{a>a0}  =  flO)  tnen  similar  to  the  first  example,  [A2]  is 
satisfied  with  6=1  and  <7*(A)  =  <7(A). 

(b)  Broken  Discontinuous  Trend:  yt  =  fa  +  ^il{t>fc0+i}  +  fat  +  ^2l{t>jfc0+i}2- 
Here,  <7^(A)  =  (tfj,  ^2)'1{a>Ao}>  an(^  [A2]  is  satisfied  with  6  =  3  and 

<7*(A)  =  (0,  *2)/1{a>a0}- 

(c)  Broken  but  Continuous  Trend:  yt  =  fa  +  fat  +  ^2^{t>k0+i}(t  ~  ^o)- 

In  this  case,  g^T\X)  =  (—^[TXo],  <$2)'1{a>a0}i  an(^  [A2]  is  satisfied  with  6  =  3 
and 

9*(*)  =  (-<Mo,  ^2)'1{a>a0}- 

3.  Change  in  regression  coefficients: 

Since  <^T)(A)  =  <51{a>a0}  =  <7(A),  [A2]  is  satisfied  with  6  =  1  and  ^"(A)  =  <7(A). 

Consider  now  the  following  condition: 

[A3]  For  DT  in  [Al], 

[T\]  \ 

DTl/2  £  Xi^  °  ^  A  <  H  =►  R, 

where  R  is  a  p-dimensional  Gaussian  process  on  [0, 1]  with  72(0)  =  0,  and  having 
mean  0  and  covariance 

f;[i2(A1)/2(A2)']  =  E(min(A1,A2)), 


with 


E(A)  =   lim  -E 

T  — »oo  1 


<[T\] 


<[T\] 


t=l 


«=1 


positive-definite  for  all  A. 

This  condition  is  general  enough  to  allow  €t  and  xt  to  be  weakly  dependent,  heterogeneous 
random  variables  but  not  integrated  of  positive  order.  See  for  example  Wooldridge  & 
White  (1988). 

Examples: 

1.  Change  in  mean:   suppose  that  et  satisfies  the  regularity  conditions  in  Phillips  & 
Perron  (1988).  Define  the  parameter 


a2  —   lim 
T-*oo 


H£«) 


>   0. 


Then 


#(A)  =  (tW(\),         E(A)  =  <r2A, 

with  W  the  standard  1-dimensional  Brownian  motion. 
2.  Change  in  trend:  under  the  conditions  of  the  previous  example, 

R{X)  =  a  f\l,r)'dW(r), 
Jo 

with  W  the  standard  1-dimensional  Brownian  motion.  The  variance  of  R(X)  is 
S(A)     =     var[fl(A)] 

■A 


=     a2  I  (l,r)'(l,  r)dr 
Jo 

Jo 


1      r 


r     r 


=    cr2Q{\). 


3.  Change  in  regression  coefficients:  suppose  now  that  €t  and  xt  satisfy  the  assumptions 
in  Ploberger,  Kramer  &  Kontrus  (1989).  Then 

R(X)  =  cxQV2W{\),         E(A)  =  a2XQ, 

with  W  the  standard  p-dimensional  Brownian  motion. 

Theorem  3.1  Suppose  the  DGP  for  yt  is  given  by  (1)  and  (2),  and  that  [Al]  and  [A3] 
are  satisfied. 

1.  If  [A2]  holds  with  g*(X)  =  6l{\>\0}  and  ^  €  IRP  \  {0},  and  0  <  X  <  XQ  <X  <  I  then 
Xt  -^  Ao  as  T  — »  oo, 

2.  If  there  is  no  change,  i.e.  g^T\X)  =  0,  and  X  =  0  and  X  =  1  then  Xt  — — »  {0, 1}  as 

T  —*  oo. 

When  there  is  no  structural  change,  one  way  to  have  consistency  of  Xt  m  the  sense  of 
part  2  of  Theorem  3.1  is  to  allow  fcj  to  take  all  possible  values  in  the  set  {p,...,T  —  p] 
as  in  (5).  If  this  set  is  restricted  as  in  (6)  then  consistency  is  not  possible. 

4      Spurious  Break:  yt  is  Integrated  of  Order  One 

Suppose  now  that  the  true  DGP  is  characterized  by  no  structural  change,  but  et  is  1(1), 
violating  condition  [A3].  We  show  that  when  estimating  a  model  with  structural  change, 
the  QMLE  estimator  Arj  will  suggest  a  spurious  break.  This  is  shown  in  Theorem  4.1,  as 
well  in  the  simulations  in  section  5. 

Suppose  then  that  we  estimate  model  (3)  as  in  section  2  and  that  [Al]  continues  to 
hold  for  xt.  However  yt  is  generated  by  an  1(1)  process  and  there  is  no  structural  change. 
Instead  of  (1)  and  (2)  the  DGP  for  yt  is  given  by: 

yt    =    yt-\  +  vt,         yt  =  o, 

with  {nt}  an  1(0)  stochastic  process  such  that  the  following  condition  holds: 


[A3']  For  Dt  in  [Al]  and  some  constant  a  >  0, 

[T\]  \ 

T-a/2D-H2  £  Xtyt,  0  <  A  <  1      =>  G, 

where  G  is  some  p-dimensional  functional  of  a  Gaussian  process. 

The  main  difference  between  condition  [A3]  and  [A3']  is  the  rate  of  convergence  in  the 
term  T~al2. 

Examples: 

1.  Change  in  Mean:  [A3']  holds  with  a  =  2  and 


Vt   =>   G(X)  :=   a  / 
2.  Change  in  Trend:  again  a  =  2  and 


T-^Tvt   =>   G(X)  :=   a  /    VT(r)(fr. 


1  W{r)dr. 


G(A)  :=  ./;  ( 


3.  Regression:  in  this  case  [A3']  holds  with  a  —  1.  Let  xt  and  ^  be  independent,  and 
w't  =  (x't,  T]t)'  satisfy  a  multivariate  invariance  principle  as  in  Park  &  Phillips  (1988). 
More  specifically  T-1/2  XZy=i  wj  =*"  #(^)  a  (P  +  1)- vector  Brownian  motion  B'  = 
(B„  5,)'.  Then 


T-^Xey,   =>  G(A)  :=    /    B„dfl 


Theorem  4.1   Suppose  the  DGP  /or  t/*  ts  given  by  (7),  [Al]  and  [A3']  are  satisfied,  and 
0<A<A0<A<1.  Then 

AT    =>     argmaxA€[A  X]  G(A)'g(A)"1G(A) 

+  [G(l)  -  G(A)]' [Q(l)  -  Q(A)]"1  [G(l)  -  G(A)] . 


Remark  1:  the  theorem  says  that  \?  has  an  asymptotic  distribution  with  support  equal 
to  [A,  A].  It  is  possible  that  the  support  could  be  enlarged  to  [0, 1]  but  the  proof  of  this 
would  require  deeper  mathematical  results.  The  simulation  results  which  will  be  discussed 
in  section  5  suggest  that  the  spurious  break  holds  for  [0, 1]. 

Remark  2:  suppose  that  xt  =  (x\t,...,xvt)'.  Condition  [A3']  can  be  written  as: 
[TX]  \ 

T-a'/2d,T     £x'^'  0  <  A  <  1 J  =>G„      i=l, ...,p, 

with  a, ;  =  a  >  0.  If  however  the  a,  are  not  all  the  same,  then  let  a,  =  maxi<t-<pat-, 
and  denote  by  G*  the  vector  of  elements  of  G  =  (G,,  i  =  1,  •  •  -,p)'  for  which  a,  =  a.. 
Similarly  define  Q*{\)  as  a  submatrix  of  Q{\).  Then  Theorem  4.1  holds  with  G  and  Q(A) 
replaced  by  G,  and  Q,(A)  respectively. 


5      Simulations 

In  this  section  we  discuss  some  simulations.  The  model  used  in  the  simulations  is  the 
change  in  mean  model  with  autocor related  disturbances.  We  do  not  report  our  fur- 
ther results  for  the  change  in  trend  models,  nor  for  the  model  employed  by  Ploberger 
h  Kramer  (1992)  and  Andrews  (1990)  since  all  the  results  were  qualitatively  identical. 

In  the  simulations  we  have  included  all  possible  break  points  as  in  (5).  The  number  of 
replications  for  each  experiment  was  100,000  and  we  have  used  the  normal  pseudo-random 
number  generator  in  GAUSS-386  for  iid  N(0, 1)  innovations. 

The  model  simulated  is  given  by: 

Vt  =  fit  +  ^, 

fit  =  *l{t>[TA]+l}i 

€t  =  <f>€t-l  +  Vt, 

T]t  iid  #(0,1). 

The  number  of  observations  in  each  experiment  was  T  €  {25,  50,  100}. 

Figure  1  shows  the  results  for  a  change,  6  =  1,  in  the  middle  of  the  sample,  A  =  0.5, 
for  4>  6   {0,    0.5}.    As  T  increases,  the  estimator  becomes  more  precise.     When  some 


autocorrelation  is  present,  <f>  =  0.5,  the  estimator  loses  precision  when  compared  with  the 
case  of  no  autocorrelation,  <f>  =  0,  and  for  each  sample  size  considered. 

In  Figure  2,  we  let  the  change  happen  nearer  the  beginning  of  the  sample,  A  =  0.25. 
The  same  comments  made  for  Figure  1  apply  here.  When  the  two  Figures  are  compared, 
we  see  that  the  mode  of  the  distribution  is  shifted  to  the  new  break  date. 

The  case  of  no  change,  6  =  0,  and  1(0)  errors,  <f>  6  {0,  0.5},  is  given  in  Figure  3.  We 
see  that  the  mass  of  the  distribution  is  more  concentrated  in  the  tails  than  in  the  middle. 
Also,  as  T  increases,  the  mass  tends  to  become  more  concentrated  in  the  tails. 

Finally,  in  Figure  4,  consider  the  spurious  break  case:  no  change,  6  =  0,  but  1(1)  errors, 
<f>  =  1.  As  opposed  to  Figure  3,  and  more  like  Figure  1,  the  mass  of  the  distribution  is  not 
concentrated  in  the  tails  and  is  more  concentrated  in  the  middle.  The  contrast  between 
Figures  1-3  and  Figure  4,  clearly  suggests  what  we  mean  by  a  spurious  break.  We  have 
also  simulated  the  asymptotic  distribution,  T  =  oo,  in  this  case.  The  density  estimate 
was  approximated  using  T  =  1000  and  100,000  replications,  and  smoothing  the  resulting 
empirical  density  function  with  a  Gaussian  kernel  with  bandwidth  equal  to  0.2.  The 
distribution  seems  to  be  very  stable  for  each  value  of  T  €  {25,50, 100,  oo}. 

In  conclusion,  the  simulation  results  support  the  asymptotic  results  of  sections  3  and  4. 
Moreover,  they  suggest  that  the  insights  provided  by  the  asymptotics  are  relevant  for  quite 
small  sample  sizes. 

6      Conclusion 

In  this  paper  we  have  shown  that  the  quasi-maximum-likelihood  break  date  estimator  is 
consistent  when  the  error  term  satisfies  a  condition  which  allows  for  general  error  processes 
not  integrated  of  positive  order.  We  also  obtained  the  asymptotic  distribution  of  the  break 
date  estimator  when  the  series  is  generated  by  an  1(1)  process.  In  this  case,  estimation  of 
a  structural  change  model  will  result  in  the  appearance  of  a  spurious  break. 

An  important  question  that  remains  without  an  answer  is  the  problem  of  estimating 
and/or  testing  simultaneously  for  both  unit  roots  and  structural  change.  Our  results,  to- 
gether with  some  of  those  cited  in  the  introduction,  suggest  that  disentangling  information 
in  data  about  these  two  possibilities  is  likely  to  be  a  formidable  task.  This  is  essentially 
the  graphical  insight  suggested  by  Hendry  &  Neale  (1991). 
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Appendix:  Mathematical  Proofs 


Lemma  A.l   Given  [Al]  and  [A2], 
[TX] 


(8) 


and 


T 

T-^Dr"2      Y.     ^<Sm(^)^J'dQ(r)g-(r)  (9) 


t=[T\]+\ 
as  T  — »  oo,  uniformly  in  A  €  (0,  1). 

Proof  of  Lemma  A.l: 

[TA] 


fXdQ(r)g*(r). 

Jo 


•A 


The  last  line  follows  from  [Al],  [A2]  and  Lemma  4  equation  (28)  in  Kramer,  Ploberger  &: 
Alt  (1988).  This  proves  (8).  The  proof  of  (9)  is  similar.       □ 

Lemma  A. 2   //  the  true  DGP  for  yt  is  given  by  (I)  and  (2)  then 

AT  =  min{A  :  A  =  argmaxu€[Al]MT([Tu])},  (10) 

where 

k  k  k 

MT(k)  =  (E«)(E^0_1(^x^) 

t=\  t=l  t=l 

+  (  E  «?*{)(  E  x^Y\  E  **«?)  (") 

t=fc+l  t=fc+l  t=k+l 

and 

<  =  ^  +  «iBrCr>(*).  (12) 
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Proof  of  Lemma  A. 2:     The  DGP  for  yt  can  be  rewritten  as 


yt  =  x't(3  +  (l 

From  (13)  it  is  easy  to  derive  that 
T 


(13) 


(14) 


Since  Ylt=i  €t2  does  not  depend  on  k,  the  estimator  for  k  will  be  the  integer  kj-  which 
solves: 

max     Mj(k).     D 

p<k<T— p 

Proof  of  part  1  of  Theorem  3.1:     From  (11)  and  (12)  we  have 

MT(k)  =  Mi-(k)  +  M$(k), 
where 

Mm  =  (f>;  +  jn(L)  V,)(X>*ir(X><  +  x'x'amQ) 

and 


t=i 


M}(k)  =  {  e  ...i+^^^ijt  E  *.*0_,(  E  ^  +  ^;9(T,(?)) 

t=Jfc+l  t=k+\  t=k+\ 

Using  [Al],  [A2]  and  [A3]  it  follows  after  some  manipulations  and  Lemma  A.l  that 

T-bM'T{[T\))^Lx{\) 
and 

T-bM±([T\})  JU  L2(X) 
as  T  — »•  oo,  uniformly  in  A  €  (0, 1),  where 


Ii(A)    = 


L2(A)     = 


\XdQ{r)g\r) 
Jo 

j'dQ(r)g*(r) 


QW 


-i 


fXdQ(r)g*(r) 
Jo 


[QM-QW] 


-i 


<*Q(r)<7*(r) 
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Then  we  have 

T-bMT([TX])  JU  Lx(A)  +  L2(A).  (15) 

Using  the  definition  of  <7*(A), 

L1(A)+L2(A) 


[(Q(l)  -  Q(X0))6]'[Q(l)  -  Q(X)}-1  [(Q(l)  -  Q(X0))S] ,        A  <  A 


[(Q(A)  -  Q(X0))S]'Q(X)-i  [(Q(X)  -  Q(X0))6) 

+  [(Q(l)  -  Q(\))6)'  [Q(l)  -  Q(A)]"1  [(Q(l)  -  Q(A))f] ,    A  >  A0 

*'  ([£(1)  -  Q(Ao)]  [Q(l)  -  Q(A)]"1  [Q(l)  -  Q(A0)])  6,    A  <  A0 
*'  (00)  "  QM  -  Q(Ao)  +  Q(Ao)g(A)-lQ(A0))  *,         A  >  A0 
=     /(A). 

Since  by  [Al]  Q(A2)  -  Q(X\)  is  nonnegative-definite  for  all  A2  >  Aj,  it  is  easy  to  see  that 
a  maximum  of  /(A)  occurs  exactly  at  A  =  Ao  for  any  6  ^  0.  Since  the  convergence  in  (15) 
is  uniform  in  A  6  (0, 1),  this  proves  the  Theorem.       □ 

Proof  of  part  2  of  Theorem  3.1:  Since  there  is  no  change,  e*  =  et  in  (12).  So  from 
(11)  we  have 

MT(k) 

k  k  k 


t=\     t=\       t=i 


T  T  _j        T 


<=i  «=i  <=i 

+  (  f;  <,*;V/2)(^,/2  £  ,„W">)-'(d;./>  £  «.«.) 


t=Jfc+l  <  =  ik+l  t=Jfc+l 

From  [Al]  and  [A3]  and  applying  the  continuous  mapping  theorem  the  following  holds  for 
0<  A  <  1: 

MT([TX])    =>     M{X) 

=     RiXYQiX)-1  R(X) 

+  [/2(1)  -  ^(A)]'  [Q(l)  -  Q(A)]-1  [72(1)  -  ,R(A)] .  (16) 
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From  the  assumption  of  absolute  continuity  in  [Al],  Q(X)  can  be  written  as 

Q(X)=   fXq(r)dr 
Jo 

with  q(r)  a  nonnegative-definite  symmetric  matrix.  By  L'Hospital's  rule, 

Um^>  =  ,(0).  (17) 

Since  £(A)  is  positive-definite  it  can  be  written  in  the  form 

£(A)  =  P(X)P(X)' 

with  P(X)  =  U (X)A(X)1  /2U (X)' ,  where  A(A)  is  the  diagonal  matrix  of  eigenvalues  and 
U(X)  the  orthogonal  matrix  of  column  eigenvectors  of  S(A).  Then  the  process 


Z(X)  =   I    P(r)  dW(r), 
Jo 


where  W  is  the  standard  p-dimensional  Brownian  motion,  has  the  same  distribution  as 
R(X).  See  chapter  5  of  Arnold  (1974).  With  probability  one  (w.p.l),  the  set  of  limit  points 
of  the  net 

Z(A) 


y2Aloglog(i)Je.1>Aio 
is  equal  to  the  closed  ellipsoid 

E0  =  P(0)HP  ={xem.p:  x  =  P(0)t/,  y  €  Hv) 
where 

Hp  =  {x£W  :   |x|<  1}. 

For  a  proof  see  Arnold  (1971).  This  result  together  with  (17)  imply  that  w.p.l 

..  fl(A)'Q(A)-'fl(A) 

bm  sup -7- 

aV       21oglog(i) 

.  umsup  ,  m    my1    m 

l\\    x    J 


a-^o     ^Aloglog^)  V    A    /       y2Aloglog(i) 

=     max  7,, 
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where  7,  are  the  eigenvalues  of  P(0)q(0)   1P(0)'.  Then  w.p.l 

limsupi2(A)'g(A)-1i?(A)  =  oo.  (18) 

A— 0 

Similarly  it  can  be  shown  that  w.p.l 

limsup  [R(l)  -  /2(A)]'  [Q(l)  -  Q(A)]"1  [R(l)  -  R{\))  =  00.  (19) 

A— 1 

So  for  any  constant  a, 

Urn  P{  sup  MT([TX])  <  a} 
T_>0°      Ae[o,i] 

<     limsup  lim   P{     sup      Mj([TA])  <  a} 
*•— 0     T— °°       A€[iM-*] 

=     lim  sup  P{     sup      M(A)  <  a} 

t—O  A€[jr,l-7r] 

=     0. 

The  first  equality  follows  from  (16)  and  the  second  from  (18)  and  (19).  This  implies  that 
suPA€[o,  1]  Mt([T\])  —*  00.  But,  since  supA£rA  j,Mt{[TX])  =  Op(\)  for  0  <  A  <  A  <  1, 
then  AT  -?->  {0,  1}  as  T  -»  00  when  A  =  0  and  A  =  1.     □ 

Proof  of  Theorem  4.1:  From  (7)  it  is  easy  to  see  that  (10)  and  (11)  hold  with  c*  =  yt 
in  (11).  In  (10)  substitute  Mt([Tu])  by  T~a My([Tu]).  This  does  not  change  A^  for  any 
T  since  a  does  not  depend  on  u.  From  (11)  we  have 

T-aMT{k) 


=  r-°  (£>,*'«)( 2>«s)~  (£»«*) 
+  r-«(  f;  y,x't)(  £  I(x;)-(  f;  «*) 


t=l  t=l  t=l 

+  (  f;  ylX'tD?'2T-°ii)(D?>2  J:  x,x;V2)-'(r-^"2  £  **)■ 

t=Jfc+l  t=Jt+l  t=k+\ 

From  [Al]  and  [A3']  and  applying  the  continuous  mapping  theorem  the  following  holds 
for  0  <  A  <  1: 

T-QMT([TX])    =>    G(A),g(A)"1G*(A) 

+  [G(l)  -  G(X)]'[Q(1)  -  Q(A)]"1  [G(l)  -  G(X)]. 
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The  proof  is  completed  by  another  application  of  the  continuous  mapping  theorem. 
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